AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation

نویسندگان

چکیده

Recently, medical report generation, which aims to automatically generate a long and coherent descriptive paragraph of given image, has received growing research interests. Different from the general image captioning tasks, generation is more challenging for data-driven neural models. This mainly due 1) serious data bias: normal visual regions dominate dataset over abnormal regions, 2) very sequence. To alleviate above two problems, we propose an AlignTransformer framework, includes Align Hierarchical Attention (AHA) Multi-Grained Transformer (MGT) modules: AHA module first predicts disease tags input then learns multi-grained features by hierarchically aligning tags. The acquired disease-grounded can better represent could bias problem; MGT effectively uses framework report. experiments on public IU-Xray MIMIC-CXR datasets show that achieve results competitive with state-of-the-art methods datasets. Moreover, human evaluation conducted professional radiologists further proves effectiveness our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Categorisation of Web Tags for Delicious

ion that enables us to conclude, directly from the inspection of the user profile, that these users are interested in politics. In this sense, the categorization of tags allows us represent user profiles in a tractable manner, on the basis a reduced set of meaningful categories of interest. Motivated by this, next we proceed to describe the methodology that we followed to categorise the Delicio...

متن کامل

Hierarchical Co-Clustering of Artists and Tags

The user-assigned tag is a growingly important research topic in MIR. Noticing that some tags are more specific versions of others, this paper studies the problem of organizing tags into a hierarchical structure by taking into account the fact that the corresponding artists are organized into a hierarchy based on genre and style. A novel clustering algorithm, Hierarchical Co-clustering Algorith...

متن کامل

diagnostic and developmental potentials of dynamic assessment for writing skill

این پایان نامه بدنبال بررسی کاربرد ارزیابی مستمر در یک محیط یادگیری زبان دوم از طریق طرح چهار سوال تحقیق زیر بود: (1) درک توانایی های فراگیران زمانیکه که از طریق برآورد عملکرد مستقل آنها امکان پذیر نباشد اما در طول جلسات ارزیابی مستمر مشخص شوند; (2) امکان تقویت توانایی های فراگیران از طریق ارزیابی مستمر; (3) سودمندی ارزیابی مستمر در هدایت آموزش فردی به سمتی که به منطقه ی تقریبی رشد افراد حساس ا...

15 صفحه اول

Hierarchical Search for Word Alignment

We present a simple yet powerful hierarchical search algorithm for automatic word alignment. Our algorithm induces a forest of alignments from which we can efficiently extract a ranked k-best list. We score a given alignment within the forest with a flexible, linear discriminative model incorporating hundreds of features, and trained on a relatively small amount of annotated data. We report res...

متن کامل

Alignment of separated patches: multiple location tags

Gaussian and Gabor patches can be accurately localized; however, it is not yet clear which cues (or location tags) the visual system utilizes for localization. To determine the cues used in spatial alignment, we measured and modelled the perceived shifts for asymmetric Gaussian and Gabor patches over a wide range of separations, patch sizes and orientations. For Gaussian patches we observed per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-87199-4_7